Pairwise Discriminative Speaker Verification in the 𝕀-Vector Space
نویسندگان
چکیده
This work presents a new and efficient approach to discriminative speaker verification in the i–vector space. We illustrate the development of a linear discriminative classifier that is trained to discriminate between the hypothesis that a pair of feature vectors in a trial belong to the same speaker or to different speakers. This approach is alternative to the usual discriminative setup that discriminates between a speaker and all the other speakers. We use a discriminative classifier based on a Support Vector Machine (SVM) that is trained to estimate the parameters of a symmetric quadratic function approximating a log–likelihood ratio score without explicit modeling of the i–vector distributions as in the generative Probabilistic Linear Discriminant Analysis (PLDA) models. Training these models is feasible because it is not necessary to expand the i–vector pairs, which would be expensive or even impossible even for medium sized training sets. The results of experiments performed on the tel-tel extended core condition of the NIST 2010 Speaker Recognition Evaluation are competitive with the ones obtained by generative models, in terms of normalized Detection Cost Function and Equal Error Rate. Moreover, we show that it is possible to train a gender–independent discriminative model that achieves state–of–the–art accuracy, comparable to the one of a gender–dependent system, saving memory and execution time both in training and in testing. Index Terms Speaker Recognition, I-vector, Discriminative training, Probabilistic Linear Discriminant Analysis, Support Vector Machines, Large–scale training.
منابع مشابه
SVMSVM: support vector machine speaker verification methodology
Support vector machines with the Fisher and score-space kernels are used for text independent speaker verification to provide direct discrimination between complete utterances. This is unlike approaches such as discriminatively trained Gaussian mixture models or other discriminative classifiers that discriminate at the frame-level only. Using the sequence-level discrimination approach we are ab...
متن کاملA discriminative method for speaker verification using the difference information
In this paper, a discriminative method is proposed for speaker verification. An utterance can be mapped into a matrix by computing the difference to a codebook, and then expand the mapped matrix to a vector as the input of support vector machines for speaker verification. The Gaussian mixture modelbased method is also constructed by utilizing its nature. The mapped vector indicates the utteranc...
متن کاملComparative Performance Analysis of SVM Speaker Verification System using Confusion Matrix
In Speaker verification task, it is necessary to calculate the performance of the speaker verification system; there are many systems available for the speaker verification task which uses the different type of modeling schemes like generative modeling and discriminative modeling. We are using discriminative modeling with the help of Support vector machine for speaker verification task. We prop...
متن کاملFusing Generatve and Discriminative Ubm-based Systems for Speaker Verification
In the past few years, discriminative approaches to perform speaker detection have shown good results and an increasing interest. Among these methods, SVM based systems have lots of advantages, especially their ability to deal with a high dimension feature space. Generative systems such as UBM-GMM systems show the greatest performance among other systems in speaker verification tasks. Combinati...
متن کاملSpeaker Identification and Verification Using Support Vector Machines and Sparse Kernel Logistic Regression
In this paper we investigate two discriminative classification approaches for frame-based speaker identification and verification, namely Support Vector Machine (SVM) and Sparse Kernel Logistic Regression (SKLR). SVMs have already shown good results in regression and classification in several fields of pattern recognition as well as in continuous speech recognition. While the non-probabilistic ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Trans. Audio, Speech & Language Processing
دوره 21 شماره
صفحات -
تاریخ انتشار 2013